A segmentation-free approach to recognise printed Sinhala script using linear symmetry

نویسندگان

  • H. L. Premaratne
  • Josef Bigün
چکیده

In this paper, a novel approach for printed character recognition using linear symmetry is proposed. When the conventional character recognition methods such as the arti1cial neural network based techniques are used to recognise Brahmi Sinhala script, segmentation of modi1ed characters into modi1er symbols and basic characters is a necessity but a complex issue. The large size of the character set makes the whole recognition process even more complex. In contrast, in the proposed method, the orientation features are e7ectively used to recognise characters directly using a standard alphabet as the basis without the need for segmentation into basic components. The edge detection algorithm using linear symmetry recognises vertical modi1ers. The linear symmetry principle is also used to determine the skew angle. Experiments with the aim for an optical character recognition system for the printed Sinhala script show favourable results. ? 2004 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Segmentation-free Approach to Recognise Printed Sinhala Script

Majority of character recognition algorithms such as the use of ANNs needs segmentation of the script prior to recognition. Contrast to Western scripts, Brahmi descended South Asian scripts such as Sinhala consist of modifier symbols, which make the segmentation a difficult task that needs to be addressed as a separate issue. Further, the change of shape of the basic character (by violating mod...

متن کامل

Lexicon and hidden Markov model-based optimisation of the recognised Sinhala script

The Brahmi descended Sinhala script is used by 75% of the 18 million population in Sri Lanka. To the best of our knowledge, none of the Brahmi descended scripts used by hundreds of millions of people in South Asia, possess commercial OCR products. In the process of implementation of an OCR system for the printed Sinhala script which is easily adoptable to similar scripts [Premaratne, L., Assabi...

متن کامل

Recognition of Printed Sinhala Characters Using Linear Symmetry

Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised thr...

متن کامل

Recognition of Modification-based Scripts Using Direction Tensors

The research on the OCR technology for the Latinbased scripts has been successful in achieving the status of image scanners with built-in OCR facility. But, a majority of modification-based scripts such as Brahmi descended South Asian or Ethiopic scripts are still progressing to achieve this status. This indicates the difficulties in adopting the recognition methods that have been proposed so f...

متن کامل

A Neural Network Based Character Recognition System for Sinhala Script

Much effort has been extended in making a computer recognise both typed and handwritten characters automatically. Until quite recently, the focus of this endeavour has been on characters of English Language. As for Asian languages such as Sinhala and Tamil, little or no attention has been given. Methods currently widely used for character recognition for these languages are mainly those which i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2004